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ABSTRACT 

Compensatory mutations in RNA are generally 
regarded as those that maintain base pairing, and 
their identification forms the basis of phylogenetic 
predictions of RNA secondary structure. However, 
other types of compensatory mutations can 
provide higher-order structural and evolutionary in- 
formation. Here, we present a helix-length compen- 
sation study for investigating structure-function 
relationships in RNA. The approach is demonstrated 
for stem-loop I and stem-loop V of the Neurospora 
VS ribozyme, which form a kissing-loop interaction 
important for substrate recognition. To rapidly char- 
acterize the substrate specificity {k cat /K M ) of several 
substrate/ribozyme pairs, a procedure was estab- 
lished for simultaneous kinetic characterization of 
multiple substrates. Several active substrate/ 
ribozyme pairs were identified, indicating the 
presence of limited substrate promiscuity for stem 
lb variants and helix-length compensation between 
stems lb and V. 3D models of the l/V interaction 
were generated that are compatible with the 
kinetic data. These models further illustrate the 
adaptability of the VS ribozyme architecture for sub- 
strate cleavage and provide global structural infor- 
mation on the l/V kissing-loop interaction. By 
exploring higher-order compensatory mutations in 
RNA our approach brings a deeper understanding 
of the adaptability of RNA structure, while opening 
new avenues for RNA research. 

INTRODUCTION 

Compensatory mutations play an important role in the 
evolution of macromolecular systems. Their identification 



has proven to be important for the characterization of 
macromolecular structure and for revealing important 
aspects of function. Studies of compensatory mutations 
in functional RNA molecules are particularly informative, 
because of the simple base-pairing rules that govern the 
formation of RNA helical structures. Compensatory mu- 
tations in RNA are therefore generally seen as those that 
maintain base pairing interactions, and their identification 
has been central for comparative sequence analyses in the 
prediction of RNA secondary structures and for phylo- 
genetic analyses. 

Compensatory mutations in RNA are not strictly 
limited to base-pairing interactions. Other types of struc- 
tural compensations, such as helix-length compensation, 
can help provide higher-order structural and evolutionary 
information. The concept of helix-length compensation 
was first introduced in the prediction of coaxially 
stacked helices for rRNAs (1). It was suggested that 
since coaxially stacked helices are expected to maintain a 
constant overall length in a functional RNA architecture, 
comparative sequence analysis could support coaxially 
stacked helices when one helix in a group of organisms 
is shorter, while its partner helix in that group is longer by 
the same number of base pairs (1). In support of this 
theory, comparative sequence analysis previously pre- 
dicted the existence of coaxially stacked helices in 23S 
rRNA [helices 2646-2652/2668-2674 and 2675-2680/ 
2727-2732; (2)], which was subsequently confirmed in 
the X-ray structure of the 50S rRNA subunit (3). 
Coaxial stacking of the group I intron PI and P2 helices 
were also previously inferred from helix-length variations 
(4) and later confirmed from crystal structures (5,6). A 
more elaborate investigation of tertiary structure compen- 
sation has been previously reported as part of a compara- 
tive sequence analysis of mitochondrial tRNA sequences 
that do not fit the standard cloverleaf consensus (7). 3D 
modeling helped describe several types of structural com- 
pensations, including helix-length compensations, that 
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can help maintain the global L-shape in these atypical 
tRNAs (7). 

In this work, we extend the concept of helix-length com- 
pensation for structure-function studies of RNA using the 
Neurospora Varkud satellite (VS) ribozyme as a model 
system [for reviews, see (8-10); Figure 1A]. Like other 
nucleolytic ribozymes, the VS ribozyme adopts a simple 
RNA architecture that allows phosphodiester cleavage 
activity. Notably, the VS ribozyme has the unique 
ability to specifically recognize its stem-loop I (SLI) sub- 
strate through formation of a kissing-loop interaction 
with stem-loop V [SLV; (11-13)]. This I/V kissing-loop 
interaction of the VS ribozyme is ideally suited for per- 
forming helix-length compensation studies since variant 
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Figure 1. Primary and secondary structures of the substrate/ribozyme 
(S/R) pairs characterized here. (A) Parental VS ribozyme (R 0 ) and sub- 
strate (S 0 ). Substrate recognition involves a kissing-loop interaction 
between residues 630-632 of So and 697-699 of R 0 . The arrowhead 
points to the cleavage site and the dashed box encloses the region 
under investigation. (B) Variant S/R pairs. Only the mutated regions 
are shown along with the nomenclature of the mutated substrates and 
ribozymes. 



substrates and ribozymes can be synthesized separately 
and then mixed in various combinations for kinetic char- 
acterization (Figure IB). Furthermore, in the 3D models 
of the VS ribozyme, helices I and V are extended away 
from each other in a similar manner as coaxially stacked 
helices (14-18). It is expected that adding (or removing) 
base pairs within stem lb would disrupt productive 
docking of the SLI cleavage loop with the A730 loop of 
stem-loop VI to form the proposed active site (10,19-27) 
and significantly reduce cleavage activity, as previously 
observed for stem V length variants (16,28). We hypothe- 
size that cleavage activity of stem lb length variants could 
be restored by adjusting the number of base pairs within 
stem V of the ribozyme, such that the total number of base 
pairs within stems lb and V is preserved (Figure IB). To 
explore this possibility, we combined kinetic characteriza- 
tion of variants and 3D modeling. Our helix- 
compensation study with the VS ribozyme provides 
global structural information and reveals the adaptability 
of RNA architecture. 



MATERIALS AND METHODS 

Plasmids 

The pAvapl plasmid used for transcription of Avapl 
(Figure 1A) was constructed by PCR from the A-3 clone 
(18,29). The plasmids for transcription of Avapl ribozyme 
variants (Figure IB) were prepared by QuikChangell 
site-directed mutagenesis from pAvapl (Stratagene, CA). 
All plasmids were validated by DNA sequencing. 

Preparation of RNA for kinetic studies 

The R Q ribozyme and variant sequences (Figure 1) were 
prepared by in vitro T7 RNA polymerase transcription 
from the pAvapl plasmid (or derivatives) linearized at 
the Aval site. All ribozymes were purified as previously 
described (18). The S 0 substrate and variant sequences 
(Figure 1) were synthesized in vitro using T7 RNA poly- 
merase and a single-stranded synthetic oligonucleotide 
template (Integrated DNA Technologies, IA). They were 
purified by denaturing gel electrophoresis, dephos- 
phorylated with calf alkaline phosphatase and 
5'-end-labeled with y-( 32 P) ATP (PerkinElmer, Ont.) 
using established procedures (18). 

Single-substrate kinetic studies 

All reactions were carried out at 30°C in reaction buffer 
(50 mM Tris pH 7.5, 25 mM KC1, 2mM spermidine and 
25 mM MgCl 2 ). Prior to starting the cleavage reaction, the 
5'- 32 P-labeled substrate was heated to 95°C for 1 min in 
storage buffer (lOmM Tris pH 7.6 and 1 mM EDTA) and 
snap-cooled to promote hairpin formation. Cleavage re- 
actions were initiated by addition of equal volumes (20 ul) 
of the 5'- 32 P-labeled S and R, either the wild-type R (R Q ) 
or variants (Figure IB), after a 5-min pre-incubation of S 
and R at 30°C in reaction buffer. Typically, about 10 
aliquots of 3 (il were removed from the 40-ul reaction 
mixture at specific times, and the reaction was stopped 
by addition of 12 ui of stop mix (50mM EDTA pH 8.0, 
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0.04% xylene cyanol, 0.04% bromophenol blue and 
>85% formamide) and transfer to 4°C. Samples were 
analyzed by denaturing gel electrophoresis [20% acryl- 
amide:bisacrylamide (19:1) / 7M urea gels]. For each 
time point, the amounts of substrate and product were 
quantified using a Molecular Imager FX and the 
Quantity One ID analysis software version 4.6.2 
(Bio-Rad Laboratories, Ont.). 

Single-turnover kinetic experiments were typically per- 
formed with [S] = 250 pM and excess R (0.005-5 uM). The 
percentage of remaining substrate [F = 5 t /(5 t +P t )*100%; 
where S t and P t are the amount of substrate and product 
at time t] was plotted against time. The resulting time 
courses were fitted to the equation F = Ae _( - s * l) + 
F^, where A is the amplitude, k obs is the rate of the 
reaction and F^ is a variable that represents the amount 
of substrate at the end point of the cleavage reaction. The 
value of k obs , the standard error for & obs , and F^ were 
estimated by non-linear regression analysis within the 
OriginPro 8.1 software (OriginLab, MA). The F^ values 
are <10% for all S/R combinations. 

The dependence of k ohs on [R] was analyzed by linear 
regression to derive the second order rate constant of the 
reaction, k cat /K M . The linear dependence of k ohs on [R] 
was observed from 0.005 to 1 uM for S Q /R Q and from 
0.005 to 3uM for both S+ lhp /R 0 and S_i hp /R 0 . The 
quality of the fit for the linear dependence of the k obs 
on [R] was obtained from the square of the correlation 
coefficient (R ), and in all cases R J was > 0.99. 
Single-substrate kinetic experiments were all performed 
from at least three independent experiments, and 
reported k C!lt /K M and their errors are average values 
and standard deviations, respectively, from these 
multiple measurements. 

Multiple-substrate kinetic studies 

Multiple-substrate kinetic studies were carried out as 
single-turnover reactions under the same conditions as 
the single-substrate kinetic studies. The main difference 
is that the cleavage reactions were initiated by addition 
of equal volumes (20 ul) of R (0.005-5 uM of either R Q 
or a variant) and a mixture of 5'- 32 P-labeled nucleic 
acids containing all substrates (5"_ 2 bp, S-ibp, S +lhp , 
5 , +2b P , 5 +3 bp and S +4bp ; 35 pM of each) and, in certain 
cases, a non-cleavable DNA hairpin control (5'- CGG 
CGA AGC ATC GTC GGC TTC GCC G -3'; 35 pM). 
In addition, the percentage of remaining substrate was 
expressed as F = (5 , t /5o)*(Co/C t )*100% and plotted 
against time, where 5 0 and C 0 are respectively the 
amount of substrate and non-cleavable S_ 2 b P control 
at time t = 0 and S t and C t represent the correspond- 
ing values at time t^0. Values for the reaction rate 
k 0 bs, amount of uncleaved substrate F^ and specificity 
constant k cat /K M were derived as for the single- 
substrate kinetic studies. All multiple-substrate kinetic 
experiments were performed at least three times, and 
reported k cat /K M and their errors are average values and 
standard deviations, respectively, from these multiple 
measurements. 



Localization of the substrate cleavage site 

The site of cleavage was determined for all S/R combin- 
ations where cleavage was observed. For this purpose, the 
substrates were subjected to cleavage in single-substrate 
kinetic conditions chosen to yield ~50% cleavage. 
Cleavage reactions were analyzed by denaturing gel elec- 
trophoresis [20% acrylamide:bisacrylamide (19:1) / 7M 
urea gels] and by comparison with partial Tl nuclease 
digest and alkaline hydrolysis of S 0 . 

Native gel studies of RNA substrates 

Non-denaturing polyacrylamide gel electrophoresis was 
carried out on a 7.5% polyacrylamide bisacrylamide 
(37.5:1) gel in TB buffer (50 mM Tris borate pH 8.0 and 
5mM magnesium acetate). Prior to loading, the RNA 
substrates were heated at 95°C for 2min in storage 
buffer and transferred on ice for 5min. The gel was 
pre-run for 30min at 200 V, run for 2h at 250 V and 
4°C and then stained with Stains All (Sigma-Aldrich, 
Ont.). 

Structural modeling of SLI/SLV complexes 

3D models of the SLI/SLV complex were generated using 
a stepwise procedure (Supplementary Figure S3). As a first 
step, RNA fragments were modeled using the constraint- 
satisfaction software MC-Sym 3.3.2 (30). Several models 
of the I/V kissing-loop junction core (residues 626-634 
and 695-701) were first obtained separately using the 
NMR structure of magnesium-bound SLV (31) and con- 
straints derived from biochemical studies, as previously 
described (18). A unique structural model was obtained 
for each type of SLI stems (S-ibp to S +4 b p ; residues 
619-625 and 635-640 for So), in accordance with the sec- 
ondary structure model (Figure 1) and the NMR structure 
of the active conformation of the cleavage site internal 
loop (32). A unique structural model was also obtained 
for each type of SLV stems (R-4b p to R+i bp ; residues 689- 
694 and 702-707 for R Q ), in accordance with the secondary 
structure model (Figure 1). Before being grafted together, 
the core and stem models were energy minimized using the 
molecular dynamics software Sander (33) based on the 
Amber force field and including covalent, electrostatic, 
Van der Waals energies, as well as implicit solvation in 
the form of a Generalized Born model (34). Several rounds 
of minimization (100000) were first applied including re- 
straints to maintain Watson-Crick base pairing and 
followed by fewer rounds of minimization (1000) 
without these restraints. The minimized core and stem 
models were then filtered using a custom C++ program 
based on the freely accessible C++ library MC-Core (30). 
The filter-selected models that preserved global structural 
integrity (heavy-atom rmsd between the pre- and 
post-minimization model <2.0A) and key structural 
features (covalent integrity of backbone, Watson-Crick 
base pairs at the I/V kissing-loop junction, and 
hydrogen bonds typical of U-turn structure in SLI 
between U628 11-0 and G630 N7 and between U628 N3 
and U631). Each remaining core model was then 
assembled using MC-Sym with all possible combinations 
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of stem I (5_ib P to S+ 4 b P ) and stem V (R-4b P to R+ib P ), 
energy minimized and filtered to create 36 sub-models that 
form a common-core group. 

For all sub-models, helical axes were computed using 
Curves+ for stem lb and V (35). The interhelical angle 
(#i_v) and axial displacement (g?i_v) between these two 
axes were computed by vector methods within Pymol 
using Python scripts. For calculation of ^i_v, a reference 
point O was first set midway between atoms correspond- 
ing to A698 N3 and U631 Nl of the kissing-loop inter- 
action. Two segments OA and OB were calculated that 
start at this reference point to reach the two helical axes 
at point A and B in a perpendicular orientation. The 
distance of the AB segment was taken to represent the 
axial displacement. The reported values of f? T _v and rf T _v 
are average values of all sub-models from a common-core 
group. 

Selection of SLI/SLV models compatible with kinetic data 

In order to select common-core groups that are compat- 
ible with the kinetic data, the sub-models of each 
common-core group were first compared to their reference 
Sq/Rq sub-model using an rmsd criteria. The sub-models 
were first superposed to the S Q /R 0 sub-model based on 
alignment of the first base pair of SLV (residues 689 and 
707 in S 0 /R 0 ). Heavy atoms pairwise rmsds between the 
SLI cleavage site region of the S Q /R 0 sub-model (residues 
619-623 and 637-640) and that of each sub-models were 
then computed. For each common-core group, the rmsd 
distributions for the high-activity [(A' cat /^T M )/(^cat/ 
^ M )>0.25)] and low-activity [(k cli tlK M )/(k c J 
-^m)<0.01)] classes of S/R pairs were compared using a 
Welch's /-test (36). The normality assumption was verified 
using the Shapiro-Wilks test (37) on each distribution, and 
no significant violation was found (a = 0.05). Common- 
core groups compatible with kinetic data were selected 
when statistically significant differences (at a = 0.02) 
were observed between the mean rmsd values of 
high-activity and low-activity classes of S/R pairs. 



RESULTS 

For helix-length compensation studies, we performed 
kinetic studies of trans cleavage with the Avapl VS 
ribozyme and an SLI substrate, herein termed R 0 and 
So, respectively (Figure 1A). The S 0 substrate adopts a 
pre-shifted active state (Figure 1A) that is rapidly 
cleaved by the VS ribozyme (18). Helix-length mutations 
were introduced within stem lb of So (Figure IB) that 
remove up to 2 bp (S_ib p and S_ 2 b P ) or add up to 4 bp 
(S+ibp, S +2 bp, S +3 bp and S +4 b p ). Helix-length mutations 
were also introduced within stem V of R 0 (Figure 1 B) that 
add up to 2 bp (R+\b P and R+2b P ) ° r remove up to 4 bp 
(i?_ib P , R-2b P , ^-3b P and ^_ 4 b p )- Compensatory sub- 
strate/ribozyme (S/R) variant pairs are those for which 
the total number of base pairs in stems lb and V is iden- 
tical to the S 0 /R 0 reference pair (Figure IB). We hypothe- 
size that, in the simplest scenario, compensatory S/R 
variant pairs would retain cleavage activity, whereas 
other S/R variant pairs would not. 



Single-substrate kinetic studies 

We first examined the specificity of R Q for each substrate 
individually by determining the second-order rate 
constant (k cat /K M ) under single-turnover conditions (18), 
as illustrated for S 0 in Figure 2. Cleavage reactions were 
performed with 32 P-labeled S 0 and excess R 0 and moni- 
tored by denaturing gel electrophoresis (Figure 2A). The 
percentage of remaining substrate was plotted against time 
to derive the value of the first-order rate constant k ohs 
(Figure 2B). For S_ib P , S 0 and S+ib p a linear relationship 
is observed when k ohs is plotted as a function of R 0 con- 
centration, and the values of catalytic efficiency (k CiLt /K M ) 
were obtained from the slope of this graph (Figure 2C and 
Table 1). For all other substrates (S_ 2 b P ^ S +2 b P , S +3 b p 
and S+4b p ), the value of k ohs is below the detection 
limit (< 0.001 min -1 ) at R Q concentrations up to 5uM, 
and therefore the value of k oat /K M cannot be 
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Figure 2. Single-substrate kinetic analysis of R 0 . (A) Cleavage of 
5'- 32 P-labeled So by R 0 (at 37.5 nM) monitored by denaturing gel elec- 
trophoresis. (B) The percentage of remaining substrate was plotted 
against time. The data were fitted to the non-linear equation 
F=Ae^ kobs *° + F^, to extract the value of k obs (k obs = 0.332 ± 
0.008 min -1 ; .ft 2 =1.00). (C) Cleavage reactions were performed at 
various concentrations of R 0 , and the value of k ohs was plotted 
against ribozyme concentration. For all fc D bs values, error bars are 
smaller than the data point on the graph. The data were fitted to a 
single exponential equation, and the k CM /K M is extracted by linear re- 
gression (k Ci ,i/K M = value of the slope = 1 1.2 min -1 uM -1 and 
R 2 = 0.99). 
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determined (Table 1). To insure that the lack of reactivity 
of certain substrates is not due to major folding defects, 
they were analyzed by native gel electrophoresis 
(Supplementary Figure SI). All substrates, with exception 
of 5_ 2 bp, migrate as a single band on the gel, at a position 
typical of small RNA hairpins, in agreement with the pre- 
dicted secondary structure (38). Most of the S_ 2 b P sub- 
strate migrates slower, indicating that it adopts a duplex 
conformation that is incompatible with cleavage by i? 0 - 

For R 0 , the highest value of k cat /K M is obtained with 
So (11 ± 1 min _1 uM _1 ). Surprisingly, the k cat /K M 
values for S_, (5.2 ± 0.8 min" 1 ^" 1 ) and S +1 
(2.7 ± 0.3min _1 uM _1 ) are reduced by only 2- and 
4-fold, respectively, with respect to S Q (Table 1). The 
similar substrate specificities of R 0 for S 0 and S+\ 
point to the adaptability of the VS ribozyme architecture 
to cleave a range of substrates with 3-5 bp in stem lb, 
indicating a certain level of promiscuity in substrate 
selection. 

Multiple-substrate kinetic studies 

Single-substrate kinetic studies performed with R 0 are 
fairly tedious and time-consuming, especially given that 
several time-courses are needed to derive k cat /K M values 
for an individual substrate, each k cat /K M are determined at 
least three times, and multiple substrates are being 
investigated. To accelerate the kinetic characterization of 
the remaining S/R variant pairs (Figure IB), we set up a 



Table 1. Summary of kinetic data for 


Ro' 




Values of 


t/K M (min-'ulVr 1 ) 




Single-substrate kinetics 


Multiple-substrate kinetics 


S-2bp 


n.d." 


n.d. 


S-t bp 


5.2 ± 0.8 


5.9 ± 0.7 


So 


11 ± 1 


11 ± 2 


S+i bp 


2.7 ± 0.3 


2.7 ± 0.4 


S+2bp 


n.d. 


n.d. 


S+3bp 


n.d. 


n.d. 


S+4bp 


n.d. 


n.d. 


"The k c:ll /K M and their errors (in min 


'uM~') are average values and 


standard deviations, respectively, from at least three independent ex- 


periments. 


For several S/R pairs cleavage activity was not detected 


(n.d.; /c obs 


< 0.001 min" 1 ) at [R] < 5 uM 





multiple-substrate kinetic procedure in which one enzyme 
is simultaneously incubated with a mixture of substrates 
(29). Single-turnover conditions were chosen for our 
studies because the enzyme is in excess with respect to 
all substrates, and there should be no competition 
between the various substrates. Given the different sizes 
of substrates investigated (Figure IB), the disappearance 
of any given substrate is easily monitored by denaturing 
gel electrophoresis (Figure 3). The percentage of remain- 
ing substrate can be reliably estimated from the ratio of 
substrate remaining at time t to that at time zero provided 
that normalization of input RNA in each lane can be 
achieved. For normalization, we initially added a 
non-cleavable DNA control in the cleavage reaction, but 
later found that quantification is more reliable using the 
non-cleavable 5_ 2 b P RNA that gives a better-defined band 
on the gel (Figure 3). 

The multiple-substrate kinetic procedure was first per- 
formed with R 0 , revealing essentially identical k cat /K M 
values to those obtained with the single-substrate kinetic 
method (Table 1). These results confirm the reliability of 
the multiple-substrate procedure and, thus, similar studies 
were conducted with the variant ribozymes, except for 
-R+2b P because its helix-length complementary substrate 
5_2b P is not cleavable. Interestingly, substrate cleavage is 
observed for several S/R pairs, with many having similar 
k C at/K M values as the S Q /R Q pair (Table 2). To verify that 
substrate cleavage for all functional S/R pairs yields the 
same product, the reaction products were analyzed by 
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Figure 3. Simultaneous kinetic characterization of R_3 bp with multiple 
substrates. The cleavage reactions of 5'- 32 P-labeled substrates (S) by 
R-3bp ( at lOOnM) were monitored by denaturing gel electrophoresis. 
A DNA hairpin was used as a normalization control (C). 



Table 2. Summary of multiple-substrate kinetic data' 1 



4bp 



M 



3bp 



Ratio fecat/^v 



-2bp 



Ratio k c JK, 



M 



- 1 bp 



Ratio k c JK t 



Ibp 



Ratio k CM jK M Ratio k^JK] 



M 



Ratio 



So 

S+lbp 

>^+3bp 
^+4bp 



n.d. 
n.d. 
n.d. 

0.07 ± 0.01 
0.35 ± 0.04 
0.040 ± 0.004 



0.0064 

0.032 

0.0036 



n.d. n.d. 0.11 ± 0.02 0.0095 5.9 ± 0.7 0.54 0.13 ± 0.02 0.012 

n.d. 0.11 ± 0.02 0.01 12.0 ± 0.9 1.1 11 ±2 1 0.019 ± 0.002 0.0017 

0.051 ± 0.006 0.0046 10 ± 1 0.87 14 ± 2 1.3 2.7 ± 0.4 0.25 0.013 ± 0.002 0.0012 

3.1 ± 0.2 0.28 7.4 ± 0.8 0.67 1.9 ± 0.3 0.17 n.d. n.d. 

0.66 ± 0.09 0.064 0.20 ± 0.05 0.018 n.d. n.d. n.d. 

0.020 ± 0.003 0.0018 n.d. n.d. n.d. n.d. 



a The k cal /K M and their errors (in min^'uM -1 ) are average values and standard deviations, respectively, from at least three independent experiments. 
The ratios correspond to (k CM /K M )/(k CM /K M )o, where (k CAl jK M )o is the value obtained using R 0 and So. Data for high-activity S/R pairs with 
ratios > 0.25 are highlighted in grey. For several S/R pairs cleavage activity was not detected (n.d.; /c 0 b S < 0.001 min -1 ) at [R]<5uM. 
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R+\b-p, Sq/Ro, S + ib p /R-n, p , S 
and S+3b p /i?-4bp- However, k, 



1 bp/ R 
cat/^M 



comparing their migration with alkaline and Tl digests on 
denaturing gels. The results clearly demonstrate that 
cleavage occurs between G620 and A621 for all substrates 
cleaved by either R 0 or variant ribozymes (Supplementary 
Figure S2). Thus, the helix-length mutations investigated 
here have no effect on cleavage site selection. 

As expected, helix-length compensation is observed for 
all variant ribozymes, since the highest values of k C!it /K M 
are observed with the compensatory substrate or one that 
differs by a single base pair. More precisely, the most 
active S/R combinations for each ribozyme are: S_ib P / 

-2bp> S+2bp/^-3bp 

values similar to 

that of Sq/Rq (within 4-fold; shaded in Table 2) are 
observed only for four of the six ribozymes (R 0 , R-ib P , 
R-2b P and i?_ 3bp ). For the R +l b p and R- 4 b p ribozymes, 
the maximal k oaX /K M values are reduced significantly, ~80- 
and ~30-fold, respectively, compared to S 0 /R Q . 

Structural modeling of SLI/SLV complexes compatible 
with kinetic data 

Catalysis generally requires the precise positioning of 
chemically active groups within the active site, which in 
the VS ribozyme results from the precise docking of 
the cleavage site and A730 internal loops (10,19-27). 
Thus, it is likely that the cleavage site loop is presented 
in a favorable position for productive docking in the most 
active S/R combinations, but not in the inactive or poorly 
active S/R combinations. To explore how the relative pos- 
itioning of the cleavage site loop is affected by helix-length 
mutations, a structural modeling approach was used. We 
chose to model an SLI/SLV complex as opposed to 
models of complete S/R pairs to simplify the modeling 
procedure and limit the structural consequences of 
helix-length mutations. Furthermore, the SLI/SLV 
models allow us to examine the effect of helix-length vari- 
ations on the positioning of the scissile phosphate in SLI 
relative to a common base pair in SLV. This approach is 
consistent with our kinetic studies of S/R pairs in which 
ribozyme variants retain a common core sequence and 
interact with the free substrates primarily through the 1/ 
V kissing-loop interaction (Bouchard, P., Delfosse, V. and 
Legault, P., unpublished data). We assumed that realistic 
3D models would be compatible with the kinetic data, 
such that the relative positioning of the cleavage site 
loop in active S/R combinations would be distinct from 
that found with inactive S/R combinations. 

3D models of the SLI/SLV complexes were generated 
using a stepwise procedure (Supplementary Figure S3). 
First, several models of the I/V kissing-loop junction 
core were obtained using previously described constraints 
(18). Of these core models, 56 were then selected because 
each could be used to build a common-core group of 36 
sub-models that contain all combinations of SLI (S_ib p to 
S+4 bp) and SLV stems (R-4b P to i?+ib P ) and remain con- 
sistent with input structural constraints after energy mini- 
mization. For all selected sub-models (56 x 36), the 
interhelical angle and axial displacement between stems 
lb and V were computed and average values were 
determined for each common-core group. A wide range 



of axial displacements (c/i_v — 12-24 A), and interhelical 
angles (0t_v = 108° to 175°; Figure 4) are observed among 
the common-core groups, pointing to the extensive con- 
formational sampling of the modeling procedure. 

We hypothesized that for common-core groups compat- 
ible with the kinetic data, the sub-models of the 
high-activity class p cat /^ M )/(£ C atAKM)o > 0.25); Table 2] 
should be significantly different than those of the 
low-activity class [(k caX /K M )/(k c JK M ) 0 < 0.01); Table 2] 
of S/R pairs. In order to compare the structures of the 
sub-models, pairwise rmsd's were calculated between the 
sub-models of each common-core group and their refer- 
ence S Q /R 0 sub-model. For each common-core group, the 
rmsd distributions for the high-activity and low-activity 
classes of S/R pairs were compared using a Welch's t- 
test (36). Interestingly, we observe a good correlation 
between /"-values and interhelical angles (Figure 4), 
where common-core groups with average interhelical 
angles (#i_v) closer to 180° show smaller P- values and 
are thus more likely to be consistent with the kinetic 
data. Group 13 (t9 T _v = 166.0° ± 3.6) was selected for 
structural analysis, but similar conclusions are obtained 
for other common-core groups with /"-values smaller 
than the a threshold value of 0.02 (group 12: 
e r _ v = 165.4° ± 3.5° and group 20: 9,_ v = 171.4° ± 4.5°). 

The 3D structure of the S 0 /R 0 sub-model from the 
selected group reveals that, although the helical axes of 
stems lb and V are more or less colinear, the two stems 
are not coaxial (Figure 5A). Rather, there is a significant 
perpendicular displacement (d T _v ~ 20 A) between these 
axes that results from the structure of the kissing-loop 
junction. The spatial relation between these two axes is 
reflected in the structure of variant S/R pairs, as illustrated 
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Figure 4. Identification of SLI/SLV models compatible with the kinetic 
data. Plot of the P-value (Welch's Mest) against the mean interhelical 
angle. The /"-value was obtained from a Welch's /-test that 
compared the rmsd distributions of the high-activity and low-activity 
classes of S/R pairs for a given common core group. The mean 
interhelical angle was obtained by computing the average interhelical 
angles of the 36 S/R pairs for each common-core group. All 
common-core groups showing a ,P-value < 0.02 are numbered. 
Standard deviations on the interhelical angles vary between 2° and 
7°, but were omitted from the graph for clarity. 
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Figure 5. Model of an SLI/SLV complex compatible the kinetic 
data (common-core group 13). (A) Stick representation of the 
So/Rq sub-model. (B) Cartoon representation of a superposition of 
the five variant S/R a sub-models on the S a /R 0 sub-model. The 
black arrow points at the scissile phosphate of Sq/Ro- The scissile 
phosphates of the following substrates are shown from bottom to 
top: S_i, S 0 , S+u 5+2, S +i , 5+ 4 . (C) Top-down and (D) front views 
of the superposition of the most catalytically efficient sub-models of 
each ribozyme. The following S/R pairs are shown from left to right: 
S^/R+u So/Rq, S +1 /R_ u S +1 /R- 2 , S +2 /H-u S +3 /R_ 4 . In (B), (C) and 
(D), the sub-models were superposed to the So/Ro sub-model based on 
heavy atom alignment of the first base pair of SLV (residues 689 and 
707 in So/Ro)- SLI and SLV are represented in deep blue and orange, 
respectively. Helical axes are represented as red rods and the phos- 
phorus atom at the cleavage site is shown as a sphere. The phosphorus 
atoms are color-coded according to the catalytic efficiency of the 
sub-models {[{k c .JK M )/(k cll JK M ) > 0.25]: deep orange; [0.25 > (£: cat / 
K M )l(k c JK M )>0M]: light orange and [(k c JK M )l(k c JK M ) < 0.01]: 
white}. 



in Figure 5B-D. The superposition of all S/R pairs con- 
taining R 0 shows that the scissile phosphates of the 
high-activity substrates o (deep orange spheres in 
Figure 5B) cover a ~7.5 A distance on one side of stem 
lb, whereas those of low-activity substrates (white spheres 
in Figure 5B) rotate around the helical axis towards a 
different side of the helix. Thus, this superposition indi- 
cates that limited helical displacements of the scissile phos- 
phate are compatible with productive docking of the 
substrate internal loop with the catalytic domain. 

To better understand the helix-length compensation 
observed between selected S/R pairs, we superposed the 
S/R pairs that produce maximal cleavage activity for 
each ribozyme (Figure 5C and D). It is clear from both 
the top-down (Figure 5C) and front views (Figure 5D) of 
this superposition that the scissile phosphates are dis- 
tributed along the arc of a circle. Those of the high- 
activity S/R pairs (deep orange in Figure 5C and D) span 
a chord length of ~22 A. This superposition suggests that 
large, but restricted displacements of the scissile phosphate 
along this arc are compatible with productive docking of 
the substrate internal loop with the catalytic domain. 



DISCUSSION 

In this study, we explored an unusual type of compensa- 
tory mutations in RNA, termed helix-length compensa- 
tions, using the VS ribozyme as a model system. VS 
substrate and ribozyme variants were prepared in which 
the number of base pairs in stems lb and V were either 
reduced or increased compared to the reference Sq/Ro pair. 
Several S/R combinations were found to be nearly as 
active as the reference pair, with precise and efficient 
cleavage obtained for substrate derivatives that contain 
3-6 bp in stem lb. Thus, this work expands the substrate 
specificity of the VS ribozyme architecture, while revealing 
its adaptability for efficient substrate cleavage. Two inter- 
esting aspects of this adaptability are limited substrate 
promiscuity with respect to stem lb length and 
helix-length compensation between stems lb and V. 

Substrate promiscuity in the VS ribozyme 

Substrate cleavage by R 0 occurs most efficiently with a 
stem lb containing 3-5 potential base pairs, whereas the 
catalytic efficiency for substrates with 6-8 bp in stem lb is 
reduced at least 50 000-fold compared to So- Thus, there is 
limited substrate promiscuity with respect to stem lb 
length for cleavage by R 0 , and comparable results are 
obtained with R_i and R_ 2 . A similar phenomenon has 
previously been observed for the 5'-splicing reaction of the 
group I intron ribozyme, where the cleavage site coincides 
with a G-U wobble pair in the first stem-loop (PI) of the 
intron (39). Cleavage at the correct site was observed when 
the G-U wobble pair was inserted within a 3-bp window 
of the PI stem (39). Although the cleavage mechanisms of 
the group I intron and VS ribozymes differ significantly, 
both require productive docking of a hairpin substrate 
with the catalytic domain. Thus, for both ribozymes, 
helical displacement of the scissile phosphate within a 
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window equivalent to 3 bp is compatible with productive 
docking and cleavage at the correct site. 

Helix-length compensation between stems lb and V 

We initially hypothesized that the reduced cleavage 
activity of defective stem lb variants by R Q could be 
restored by adjusting the number of base pairs within 
stem V, such that the total number of base pairs within 
the two stems would be preserved. Examination of the 
kinetic data indicates that there is, as predicted, 
helix-length compensation between stems lb and V. 
Indeed, substrates for which cleavage is not detected 
with Rq (5+2 bp, 5+3 bp, 5+4 bp) are cleaved by a ribozyme 
variant. However, the cleavage activities of the compensa- 
tory S/R pairs are not all restored to the reference level. In 
agreement with these results, 3D modeling of the I/V 
kissing-loop complex indicates that the global orientation 
of stems lb and V significantly affects the position of the 
scissile phosphate in the variant S/R pairs. 

An unexpected aspect of this helix-length compensation 
study is that it helps provide global structural information 
on the I/V kissing-loop interaction. From an ensemble of 
SLI/SLV models that display a wide range of interhelical 
angles (108° <0i_v< 175°) and axial displacements 
(di_v = 12-24 A), three models were selected that best fit 
the kinetic data. These selected models display a much 
smaller range of interhelical angles and axial displace- 
ments (0i _ v = 165°-171° and d^ v = 18-21 A), with near 
colinearity of their helical axes. By comparison, our pre- 
viously reported model of the SLI/SLV complex (18) 
exhibits interhelical parameters (0i_v = 106° and 
di_y = 16 A) that are somewhat compatible with the 
ensemble of models, but not with our selected models. 
In contrast, the approximate colinearity of stems lb and 
V is consistent with a 3D model of the VS ribozyme 
derived from biochemical and mutational data (14,28), 
however sufficient constraints were not available to pre- 
cisely define the interhelical angle between these two 
stems nor to rule out alternative models (Collins, R.A., 
personal communication). Similarly, the global model of 
the VS ribozyme based on fluorescence resonance energy 
transfer data does not incorporate sufficient constraints, 
particularly for the SLI/SLV interaction, to allow a fair 
comparison with our selected models (15,16). 
Interestingly, the model of the complete VS ribozyme 
derived from small-angle X-ray scattering (SAXS) shows 
an interhelical angle close to that of our selected models 
[0i_ v =154° and = 25 A; (17)]. This similarity 

between our selected models and the SAXS model is par- 
ticularly surprising given that the folding of the complete 
ribozyme was not accounted for in our modeling studies. 
In addition, the SAXS model includes helix VII that is not 
present in the VS ribozyme used for our kinetic study. The 
small difference in axial displacement between our selected 
models and the SAXS model likely reflects structural dif- 
ferences at the kissing-loop junction. Nevertheless, this 
comparison with the SAXS model suggests that the com- 
bination of kinetic data from helix-length variants with 
3D modeling represents a potentially powerful approach 
to extract interhelical parameters. 



Adaptability of the VS ribozyme architecture 
for substrate cleavage 

Given the intrinsic adaptability of the VS ribozymes archi- 
tecture for substrate cleavage, the modeling results give an 
appreciation of the level of structural adaptation that is 
likely at stake for productive docking of the SLI cleavage 
site internal loop with the A730 loop of SLVI. However, 
these models do not explain how high-activity S/R pairs 
move relative to the wild-type S/R pair to allow formation 
of the active site and efficient cleavage. 

An emerging view in RNA structure formation is that 
tertiary interactions form as a result of conformational 
sampling of helical domains within a distribution of con- 
formations that is limited, in part, by the inherent flexibil- 
ity of helical connectivities [for recent reviews see (40,41)]. 
It is likely that in the VS ribozyme, flexible elements such 
as its bulges, internal loops, three-way junctions and even 
its I/V kissing-loop junction (42,43) allow conformational 
sampling of helical domains, including SLI and SLVI, 
through which a well-defined active site can emerge. 
According to this view, the ability to sample the active 
conformation may be conserved in high-activity S/R 
pairs but not in those with lower activity. Future investi- 
gations are needed to help identify the dynamic features 
responsible for the conformational adaptability of the VS 
ribozyme and determine if a similar adaptability exists in 
other functional RNAs. 

Prospects for RNA research 

Helix-length compensation represents an alternative 
approach for investigating the sequence and structural 
repertoire of functional RNAs that could be applied to 
other helix pairs. In principle, helix-length compensation 
could be observed for helices that are more or less coaxial- 
ly stacked (1-6) or those that are not coaxial but nearly 
colinear as observed for the VS ribozyme. Although 
helix-length compensations have been previously 
identified by comparative sequence analysis (1-6), such 
analysis may not identify all possible helix-length 
mutants. Indeed, the genetic mechanism through which 
such complex mutation pattern evolves likely requires 
multiple mutational events with an overall lower probabil- 
ity than simple covariation of base-pairing residues. 
Moreover, sequence searches may not always take into 
account the possibility for helix-length compensation. 
Systematic helix-length compensation studies could help 
improve the consensus sequence and structural model 
for a given functional RNA. Subsequently, revised RNA 
consensus sequence could be used for bioinformatics 
database search of additional representatives of known 
functional RNAs (44^17). Helix-length compensation 
studies could also help reveal the inherent adaptability 
of other complex RNA architectures and be useful for 
the design of RNAs with novel functions. 



SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Figures SI -S3. 
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